Model Selection

Low-resource inference

# Low-resource inference

Diffucoder 7B Cpgrpo 4bit

DiffuCoder-7B-cpGRPO-4bit is a 4-bit quantized version converted from the Apple DiffuCoder-7B-cpGRPO model, optimized for the MLX framework.

Large Language Model Other

Kimi Dev 72B GGUF

A quantized version of Kimi-Dev-72B, using advanced nonlinear optimal quantization and multi-head latent attention mechanism to reduce storage and computing requirements.

Large Language Model Other

Delta Vector Austral 24B Winton GGUF

A quantized version of the Austral-24B-Winton model of Delta-Vector, quantized using the llama.cpp tool, suitable for efficient operation on different hardware configurations.

Large Language Model English

Qwen3 235B A22B 4bit DWQ 053125

This is a 4-bit quantized version converted from the Qwen3-235B-A22B-8bit model, optimized for the MLX framework and suitable for text generation tasks.

Large Language Model

Phantom Wan 1.3B GGUF

This is a project that directly converts bytedance-research/Phantom to the GGUF format for image-to-video conversion tasks.

Text-to-Video English

Deepseek R1 0528 Qwen3 8B MLX 8bit

An 8-bit quantized version based on the DeepSeek-R1-0528-Qwen3-8B model, optimized for Apple Silicon chips and suitable for text generation tasks.

Large Language Model

lmstudio-community

Llama 3.3 70b Instruct Deepseek Distilled GGUF

A multilingual text generation model fine-tuned based on unsloth/Llama-3.3-70B-Instruct-bnb-4bit, supporting English, Spanish, Latin, Arabic, and French.

Large Language Model

Transformers Supports Multiple Languages

Dans PersonalityEngine V1.3.0 24b Q4 K M GGUF

A multilingual text generation model based on Mistral-Small-3.1-24B-Base-2503, supporting 10 languages, suitable for role-playing and dialogue scenarios

Large Language Model

Gemma 3 1b It Fast GUFF

Quantized version optimized for low-end hardware and CPU-only environments, achieving production-ready inference configurations under resource constraints

Large Language Model

Bielik 4.5B V3.0 Instruct GGUF

Bielik-4.5B-v3.0-Instruct-GGUF is a Polish large language model released by SpeakLeash, converted from Bielik-4.5B-v3.0-Instruct to GGUF quantized format, suitable for local inference.

Large Language Model Other

Nousresearch DeepHermes 3 Llama 3 3B Preview GGUF

An instruction fine-tuned model based on the Llama-3-3B architecture, supporting tasks such as dialogue, reasoning, and role-playing, suitable for general artificial intelligence assistance scenarios.

Large Language Model English

Llama 3 8B Instruct Abliterated TR

Ablated version of LLaMA-3-8B-Instruct, using specific techniques to force the model to respond in Turkish

Large Language Model

Transformers Other

Zero Mistral 24B Gguf

Zero-Mistral-24B is a large language model based on the Mistral architecture, supporting Russian and English, suitable for dialogue and text generation tasks.

Large Language Model Supports Multiple Languages

Orpheus 3b Kaya Q8 0.gguf

An 8-bit quantized text-to-speech model fine-tuned from Canopy Labs' pre-trained model, supporting 24kHz English audio generation

Speech Synthesis Supports Multiple Languages

Google Gemma 3 27b It Qat GGUF

A quantized version based on Google Gemma 3's 27-billion parameter instruction-tuned model, generated using quantization-aware training (QAT) weights, supporting multiple quantization levels to meet different hardware requirements.

Large Language Model

Gemma 3 12b It GPTQ 4b 128g

This model is an INT4 quantized version of google/gemma-3-12b-it, using the GPTQ algorithm to reduce parameters from 16-bit to 4-bit, significantly decreasing disk space and GPU memory requirements.

3b Hi Ft Research Release Q4 K M GGUF

This is a GGUF format model converted from the canopylabs/3b-hi-ft-research_release model, supporting Hindi text processing.

Large Language Model Other

Turkish Llama 3 8B Function Calling GGUF

This is a Turkish function calling model fine-tuned based on the Turkish-Llama-8b-DPO-v0.1 model, specifically designed for executing Turkish function calling tasks.

Large Language Model

Transformers Supports Multiple Languages

Huihui Ai Gemma 3 1b It Abliterated GGUF

This is a quantized version of Google Gemma 3B model, optimized based on llama.cpp, suitable for running in resource-limited environments.

Large Language Model

This is the 4-bit quantized version of the Qwen/QwQ-32B model, optimized using the BitsAndBytes library, suitable for text generation tasks in resource-constrained environments.

Large Language Model

Transformers English

The RWKV-7 g1 model based on the Flash linear attention mechanism, supporting multilingual processing and having deep thinking ability

Large Language Model

Transformers Supports Multiple Languages

MS3 RP Broth 24B

An intermediate model during the Tantum merging process, created by merging multiple 24B-parameter Mistral and Llama3 variants, suitable for role-playing and text generation tasks.

Large Language Model

Transformers English

Thor V2.5 8b FANTASY FICTION 128K Q4 K M GGUF

This is a GGUF-format converted 8B-parameter language model specialized for fantasy fiction, supporting 128K context length.

Large Language Model English

Llasa 1B Q8 0 GGUF

This model is converted from HKUST-Audio/Llasa-1B into GGUF format, primarily designed for text-to-speech tasks.

Speech Synthesis Supports Multiple Languages

A hybrid model based on multiple 12B parameter models, specializing in Russian and English role-playing and text generation

Large Language Model

Llama3 8B 1.58 100B Tokens

Large language model fine-tuned based on BitNet 1.58b architecture, with Llama-3-8B-Instruct as the base model, employing extreme quantization techniques

Large Language Model

Bielik 11B V2.3 Instruct GGUF

This is the GGUF quantized version of the Polish large language model Bielik-11B-v2.3-Instruct developed by SpeakLeash, suitable for local deployment and use.

Large Language Model

Phi 3 Mini 4k Instruct Q4 K M GGUF

This model was converted from microsoft/Phi-3-mini-4k-instruct to GGUF format using llama.cpp via ggml.ai's GGUF-my-repo space.

Large Language Model Supports Multiple Languages

Llama 3.1 Storm 8B

Llama-3.1-Storm-8B is a model developed based on Llama-3.1-8B-Instruct, aiming to improve the dialogue and function call capabilities of models with 8 billion parameters.

Large Language Model

Transformers Supports Multiple Languages

Cere Llama 3.1 8B Tr

A fine-tuned version of the Llama3.1 8B large language model optimized for Turkish, trained on high-quality Turkish instruction datasets

Large Language Model

Transformers Other

Gemma 2 27b It Q8 0 GGUF

This is a GGUF format model converted from Google's Gemma 2B model, suitable for text generation tasks.

Large Language Model

Bitnet B1 58 Xl Q8 0 Gguf

BitNet b1.58 is a large language model with 1.58-bit quantization. It reduces the computational resource requirements by lowering the weight precision while maintaining performance close to that of a full-precision model.

Large Language Model

Cere Llama 3 8b Tr

A fine-tuned version of the Llama3 8b large language model optimized for Turkish, trained on high-quality Turkish instruction datasets

Large Language Model

Transformers Other

An Italian large language model optimized based on Meta-Llama-3-8B, supporting English and Italian text generation tasks

Large Language Model

Transformers Supports Multiple Languages

Llama 3 8B Instruct GPTQ 4 Bit

This is a 4-bit quantized GPTQ model based on Meta Llama 3, quantized by Astronomer, capable of efficient operation on low-VRAM devices.

Large Language Model

Distil Whisper Small Cantonese

This is a distilled Cantonese speech recognition model based on Whisper Small, achieving a CER of 9.7 (without punctuation) on Common Voice 16.0.

Speech Recognition

Transformers Chinese

Indic Gemma 2b Finetuned Sft Navarasa 2.0

Multilingual instruction model fine-tuned on Gemma-2b, supporting 15 Indian languages and English

Large Language Model

Transformers Supports Multiple Languages

Telugu-LLM-Labs

Yugo55A-GPT is a Serbian-optimized large language model merged from multiple excellent models, demonstrating outstanding performance in Serbian LLM evaluations.

Large Language Model

Transformers Other

Minueza 32M Base

Minueza-32M-Base is a base model with 32 million parameters, fully trained on extensive English text corpora, suitable for text generation tasks.

Large Language Model

Transformers English

Law LLM 13B GGUF

Law LLM 13B is a specific domain foundation model developed based on LLaMA-1-13B, focusing on tasks in the legal domain.

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase